A statistical approach to virtual cellular experiments: improved causal discovery using accumulation IDA (aIDA)
نویسندگان
چکیده
MOTIVATION We address the following question: Does inhibition of the expression of a gene X in a cellular assay affect the expression of another gene Y? Rather than inhibiting gene X experimentally, we aim at answering this question computationally using as the only input observational gene expression data. Recently, a new statistical algorithm called Intervention calculus when the Directed acyclic graph is Absent (IDA), has been proposed for this problem. For several biological systems, IDA has been shown to outcompete regression-based methods with respect to the number of true positives versus the number of false positives for the top 5000 predicted effects. Further improvements in the performance of IDA have been realized by stability selection, a resampling method wrapped around IDA that enhances the discovery of true causal effects. Nevertheless, the rate of false positive and false negative predictions is still unsatisfactorily high. RESULTS We introduce a new resampling approach for causal discovery called accumulation IDA (aIDA). We show that aIDA improves the performance of causal discoveries compared to existing variants of IDA on both simulated and real yeast data. The higher reliability of top causal effect predictions achieved by aIDA promises to increase the rate of success of wet lab intervention experiments for functional studies. AVAILABILITY AND IMPLEMENTATION R code for aIDA is available in the Supplementary material. CONTACT [email protected], [email protected]. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
منابع مشابه
Systems biology A statistical approach to virtual cellular experiments: improved causal discovery using accumulation IDA (aIDA)
Motivation: We address the following question: Does inhibition of the expression of a gene X in a cellular assay affect the expression of another gene Y? Rather than inhibiting gene X experimentally, we aim at answering this question computationally using as the only input observational gene expression data. Recently, a new statistical algorithm called Intervention calculus when the Directed ac...
متن کاملIntroducing Adaptive Incremental Dynamic Analysis: A New Tool for Linking Ground Motion Selection and Structural Response Assessment
Adaptive Incremental Dynamic Analysis (AIDA) is a novel ground motion selection scheme that adaptively changes the ground motion suites at different ground motion intensity levels to match hazardconsistent properties for structural response assessment. Incremental Dynamic Analysis (IDA), a current dynamic response history analysis practice in Performance-Based Earthquake Engineering (PBEE), use...
متن کاملImproving the Reliability of Causal Discovery from Small Data Sets using the Argumentation Framework
We address the problem of reliability of independence-based causal discovery algorithms that results from unreliable statistical independence tests. We model the problem as a knowledge base containing a set of independences that are related through the well-known Pearl's axioms. Statistical tests on finite data sets may result in errors in these tests and inconsistencies in the knowledge base. ...
متن کاملDiscovery of linear acyclic models in the presence of latent classes using ICA mixtures
Causal discovery is the task of finding plausible causal relationships from statistical data. Such methods rely on various assumptions about the data generating process to identify it from uncontrolled observations. We have recently proposed a causal discovery method based on independent component analysis (ICA) called LiNGAM, showing how to completely identify the data generating process under...
متن کاملA Multi-objective Optimization Model for Dynamic Virtual Cellular Manufacturing Systems
Companies and firms, nowadays, due to mounting competition and product diversity, seek to apply virtual cellular manufacturing systems to reduce production costs and improve quality of the products. In addition, as a result of rapid advancement of technology and the reduction of product life cycle, production systems have turned towards dynamic production environments. Dynamic cellular manufact...
متن کامل